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ABSTRACT 


Cybercrime is becoming ever more serious. Findings from the 2002 Computer Crime and Security Survey show an upward trend that demonstrates a need for a timely 
review of existing approaches to fighting this new phenomenon in the information age. In this paper, we define different types of cybercrime and review previous 
research and current status of fighting cybercrime in different countries that rely on legal, organizational, and technological approaches. We focus on a case study of 
fighting cybercrime in India and discuss problems faced. Finally, we propose several recommendations to advance the work of fighting cybercrime. 
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INTRODUCTION: 

Cybercrime is criminal activity done using computers and the Internet. This 
includes anything from downloading illegal music files to stealing millions of 
dollars from online bank accounts. Cybercrime also includes non-monetary 
offenses, such as creating and distributing viruses on other computers or posting 
confidential business information on the Internet. 


Reasons for Cybercrime: 

1. Capacity to store data in comparatively small space: The computer has 
unique characteristic of storing data in a very small space. This affords to 
remove or derive information either through physical or virtual medium 
makes it much easier. 


~ 


Easy to access: The problem encountered in guarding a computer system 
from unauthorized access is that there is every possibility of breach not due 
to human error but due to the complex technology. By secretly implanted 
logic bomb, key loggers that can steal access codes, advanced voice record- 
ers; retina imagers etc. that can fool biometric systems and bypass firewalls 
can be utilized to get past many a security system. 
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Complex: The computers work on operating systems and these operating sys- 
tems in turn are composed of millions of codes. Human mind 1s fallible and it 
isnot possible that there might not be a lapse at any stage. The cyber crimi- 
nals take advantage of these lacunas and penetrate into the computer system. 


= 


Negligence: Negligence is very closely connected with human conduct. It is 
therefore very probable that while protecting the computer system there 
might be any negligence, which in turn provides a cyber criminal to gain 
access and control over the computer system. 
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Loss of evidence: Loss of evidence is a very common & obvious problem as 
all the data are routinely destroyed. Further collection of data outside the ter- 
ritorial extent also paralyses this system of crime investigation. 
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Cybercriminals have embedded malware into legitimate applications and 
they're targeting poorly secured Wi-Fi spots, stealing passwords, and more 
in their quest to steal information. 
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Attackers like to exploit unauthorized products with weak security controls 
in the corporate cloud. 
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Zombie servers that are left unattended or are not updated offer additional 
ways to access networks. 


= 


Known vulnerabilities are not patched in time — a study has found that it 
takes an average of 193 days before a patch gets applied to fix a problem, 
even though the patch is available. That's an easy place for exploiters to get 
in and do their damage. 


RELATED WORK: 

It is important to read previous related work to both learn from the experience of 
others and to add something to our exiting body of knowledge. Exiting literature 
has been reviewed in 3 different areas: 


Crime data mining, data extraction and data focus: 
Cyber Crime Data Mining: Data mining is defined as the discovery of interest- 
ing Structure in data, where structure designates patterns, statistical or predictive 


models of the data, and relationships among parts of the data. The Data mining 
techniques is using for some results on crime mining. This Technique is applied 
to study Cyber Crime cases, which mainly concerned entity extraction, pattern 
clustering, classification and social network analysis. 


Data Extraction Event: In web page extraction is the process to extract attrib- 
utes and relationship. The Idea of this event extraction is the method of retrieving 
the information from database. The Proposed a method to append events for the 
concept of data mining techniques. Entity Extraction has been used to automati- 
cally identify person, address, 


Data Focus: This research paper is on web mining of content, using clustering 
techniques the web mining focus on the text. In data focus clustering will convert 
nonlinear statistical relationship between high dimensional data into simple geo- 
metrical relationship in low dimensional display 


METHODOLOGY: 

Data Collection: The data set is the collection of field in the data from web pages 
on the internet. The data set which consists of the text from web pages and the pic- 
tures, videos or sound format will be ignored. 


Preprocessing: A data preprocessing is a process that consists of data cleaning, 
data integration and data transformation which is usually processed by a com- 
puter program. It intends to reduce some noises, incomplete and inconsistent 
data. The Results from preprocessing step can be later proceeding by data mining 
algorithm. Clustering techniques are used to fetch the information of criminals 


Data Mining: Data mining deals with the discovery of unexpected patterns and 
new trules that are “hidden” in large databases. The use of data mining in this 
paper is to give the structured data from unstructured data of judge. In this paper 
the Data Mining techniques of crime in two directions they are: 


1. Classification of Cyber Crime 


2. Clustering Technique of Cyber Crime 


1. Classification of Cyber Crime: Crime is defined as “an act or the commis- 
sion of an act that is forbidden, or the omission of a duty that is commanded 
by a public law and that makes the offender liable to punishment by that 
law’. Crime is referred to as a comprehensive concept that is defined in both 
legal and non-legal sense. 


2. Clustering Technique of Cyber Crime: Data clustering is a process of putt- 
ing similar data into groups. A clustering algorithm partitions a data set into 
several groups such that the similarity within a group is larger than among 
groups. Clustering Can also be considered the most important unsupervised 
learning technique; so, as every other problem of this kind, it deals with find- 
ing a structure in a collection of unlabeled data. There are so many tech- 
niques used in clustering, 


K-Means Clustering Algorithm: K-means algorithm mainly used to parti- 
tion the clusters based on their means. Initially number of objects are 
grouped and specified as K clusters. The algorithm clusters observations 
into K groups, where K is provided as an input parameter. It then assigns 
each observation to clusters based upon the observation proximity to the 
mean of the cluster. The cluster's mean is then recomputed and the process 
begins again. In this paper the use of K means algorithm is the process of get- 
ting a structured data from a unstructured data. The Working of algorithm is 
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explained as follows: 

k : pre-determined number of clusters 

Algorithm (Step 0: determine value of k) 
Step 1: Randomly generate k random points as initial cluster centers 
Step 2: Assign each point to the nearest cluster center 
Step 3: Re-compute the new cluster centers 


Repetition step: Repeat steps 3 and 4 until some convergence criterion is met 
(Usually that the assignment of points to clusters becomes stable). 


RESULT: 
The result of this crime management system 1s given below: Judgment of a crime 


Table 1: Unstructured data 
Case no 0727 ABC is sentenced to jail for 15 years in the Raipur for the Cheating 
crime of a person using Computer 


Table 2: Structured data 





| Case no 


Name 





| Crime Type 


| Judgment 





| Location 





The use of K-means clustering algorithm in this paper is defining the structured 
data from unstructured data from data base. The prediction of this retrieving 
information from database is clear and understands to the user of the system. Pre- 
dict the name, case, judgment, etc. from the judge. The Judge gives the unstruc- 
tured information of a criminal as judgment. The prisoner admin get the data of 
particular criminal information in a structured format. 


CONCLUSION: 

Cyber Crime data is a sensitive domain where effective data mining techniques 
play a vital role for Cyber Crime analysis. In this paper the classify and cluster 
techniques are used to analyses the crime data from database. This technique is 
faster to get the data through web; the effective web mining 1s to get the unstruc- 
tured data to structured data. The classifications of crime type are Cheating, Forg- 
ery, etc. and clustering of crime using k-means to retrieve the data. 
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